Phase diagram of a-helical and /3-sheet forming peptides 
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The intrinsic property of proteins to form structural motifs such as a-hehces and /3-sheets leads to 
a complex phase behaviour in which proteins can assemble into various types of aggregates including 
crystals, liquid-like phases of unfolded or natively folded proteins, and amyloid fibrils. Here we use a 
coarse-grained protein model that enables us to perform Monte Carlo simulations for determining the 
phase diagram of natively folded a-helical and unfolded /3-sheet forming peptides. The simulations 
reveal the existence of various metastable peptide phases. The liquid-like phases are metastable 
with respect to the fibrillar phases, and there is a hierarchy of metastability. 



In a diagram of protein solution (e.g., refs[TH4]), the 
solubility line specifies the conditions under which a pro- 
tein crystal neither grows nor dissolves and corresponds 
to the gas-crystal coexistence line in a diagram of atomic 
system. Similarly, the gas-liquid coexistence line for an 
atomic system corresponds to the liquid-liquid separa- 
tion line for a protein solution (the latter is the line 
of coexistence of two separate liquid phases, a protein- 
rich and a solvent-rich ones). The phase behavior de- 
scribed in refs[TJ 121 14] involves native proteins, with lit- 
tle or no difference in the protein conformation in the 
different phases. However, depending on solvent con- 
ditions and physical variables such as temperature, the 
proteins may also adopt non-native conformations, re- 
sulting in the formation of various aggregates. Indeed, a 
wide range of different proteins unrelated in their amino 
acid sequence have been shown to form amyloid fibrils 
which share a common characteristic cross- /3 structure 
where peptides form /3-sheets oriented parallel to the fib- 
ril axis^ 6 . Despite the importance of the effect of pro- 
tein conformational changes on the protein phase dia- 
gram, we are aware of only one experimental diagram 
involving this effect. It is for Human /3Bl-crystallin and 
describes the coexistence lines between a solution and 
fibrillar aggregates and between these aggregates and a 
liquid-like phase. On the other hand, attempts have also 
been made [71 18] to numerically construct phase diagrams 
of peptides by determining the ground state structure of 
the condensed peptide phases. The resulting diagrams 
however do not account for the existence of metastable 
phases that are crucial to the understanding of generic 
aspects of the protein phase behavior. 

The present study makes use of a novel theoretical 
framework [9 in which proteins are described as flexi- 
ble tubes. In the model used here and described in de- 
tail elsewhere [10 the protein backbone is represented by 
a chain with finite thickness. The directional hy- 
drogen bonding is sequence-independent and can be ac- 
counted for by an analysis of the geometrical properties 
of hydrogen-bond forming atoms in protein structures 
listed in the Protein Data Bank. The hydrogen-bond en- 
ergy is denoted by e. The hydrophobic effect between Cq, 
atoms is captured by a pairwise attractive square- well po- 
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FIG. 1: Gallery of peptide aggregates, (a) a-oligomer at 
dimensionless temperature = T / Tf = 0.9, (b) and (c) 7- 
oligomer at ^ = 1.0 and 1.1, respectively, (d) 1/3-sheet at 
= 1.1, (e) 2/3-sheet at 6* = 1.2, and (f) 3/3-sheet at 6* = 1.2. 
Peptides in the solution are shown in yellow, peptides within 
an aggregate that do not form interpeptide hydrogen bonds 
are shown in blue, and those that do so are shown in brown, 
green or red. 



tential with energy ey^. As in previous studies [TO^, TP, we 
use e/e/i = 20 in order to quantify the relative strength of 
the hydrogen and the hydrophobicity-mediated bondings. 
Steric constraints are implemented by a local bending 
stiffness with energy per atom. In all our simula- 
tions the stiffness energy is set equal to 0.3e. Peptides 
that are weakly hydrophobic, and hence have such a large 
e/ch value, have been demonstrated [12 to assemble into 
the various types of aggregates considered here. We in- 
vestigated the phase behavior of a simple prototype of 
biomolecular system consisting of 12-residue homopep- 
tides in an implicit aqueous solution. The simulations 
showed that most of the peptides in the solution fold at 
least partially into a native a-helical structure at temper- 
atures below the folding temperature Tf = 0.20e/A:, and 
unfold at least partially into an extended random-coil 
structure above it. Our simulation model is thus per- 
tinent to peptides with hydrogen-bond energy e = (1.9 
to 2.5) X 10~^^ J, because these e values correspond to 
Tf = 276 to 363 K, i.e. to peptide folding temperatures 
of biophysical relevance. 

Our simulations showed that the peptides can self- 
assemble into different types of aggregates: a-oligomers - 
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disordered aggregates constituted solely of fully folded 
a-tielical peptides (Fig. [l^), 7-oligomers - also disor- 
dered aggregates but consisting of fully folded, partially 
folded and unfolded peptides (Figs. [T]3 andjl]^), and i^- 
sheets - ordered aggregates with i = 1,2,3,... layers 
of single /3-slieet tapes (Figs. respectively). Im- 

portantly, as seen in Figs. [TJd and [T]^, the fraction of 
folded/unfolded peptides in the 7-oligomers depends on 
temperature. We shall consider the above aggregates as 
representing distinct phases, because the relative contri- 
bution of the hydrophobicity-mediated and the hydro- 
gen bondings that stabilize them is different (Fig. [2|. 
While an a-oligomer is stabilized by the hydrophobicity- 
mediated bonding only, the 7-oligomer is additionally 
stabilized by the hydrogen bonding (by definition, a 7- 
oligomer is completely free of i/3-sheets). In contrast, the 
z/3-sheets are predominantly stabilized by the hydrogen 
bonding. On average (Fig. [2|, for the number h of inter- 
peptide hydrogen bonds per peptide we have 6 = for 
the a-oligomers, 6 = 1 or 3 for the 7-oligomers at dimen- 
sionless temperature = T / Tj = 1.0 or 1.1, respectively, 
and b = 18 for all i/3-sheets. Also on average (Fig.|2|, the 
number bh of hydrophobicity-mediated bonds per pep- 
tide is 155, 78, 124 and 134 for the a-oligomer, 1/3-sheet, 
2/3-sheet and 3/3-sheet, respectively. For the 7-oligomer, 
bh = 132 at l9 = 1.0 and 1.1. 

In order to determine the solubility lines for the a- 
oligomer, 7-oligomer and i/3-sheets with i = 1,2,3 we 
followed the approach used by Bai and Li[13 . We per- 
formed Monte Carlo simulations in the canonical ensem- 
ble using crankshaft, pivot, reptation, rotation and trans- 
lation moves. The simulations were carried out by using 
a cubic box and periodic boundary conditions. For the a- 
oligomer solubility determination we prepared a cluster 
of a fixed number n of fully folded peptides and placed 
this n-sized cluster into a solution containing also a fixed 
number (100 or 200) of such peptides. Isothermally, we 
performed several simulation runs at different peptide 
concentrations and monitored whether the cluster grew 
or shrank. As in this case the cluster was an a-oligomer, 
we constrained the peptides in the system to remain fully 
folded by performing only rotation and translation moves 
during the simulation. This enabled us to calculate the 
a-oligomer solubility at temperatures at which the pep- 
tides would normally start to unfold. From the first sim- 
ulation runs we identified two near peptide concentra- 
tions between which the a-oligomer would coexist with 
the solution. This provided a concentration range for 
additional runs which yielded more accurately the equi- 
librium concentration (or solubility) Ce,n at which an 
n-sized a-oligomer neither grows nor shrinks. As cluster 
solubility is known to depend on cluster size, we studied 
the dependence of Ce,n on ^ at a given by repeating 
our simulations for five a-oligomer sizes: n = 100, 200, 
300, 500 and 700. The results obtained are shown in 
Fig. [3^. The size dependence of Ce,n can be described 
by the Ostwald formula (e.g., ref.[14 ) 
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FIG. 2: Bonding in peptide aggregates, (a) Probability distri- 
bution function of the number b of hydrogen bonds per pep- 
tide in: a-oligomer (black line) at ^ = 0.7, 7-oligomer (orange 
line) at ^ = 1.0, 7-oligomer (brown line) at = 1.1, 1/3-sheet 
(blue line) at = 1.1, 2/3-sheet (green line) at ^ = 1.2, and 
3/3-sheet (red line) at ^ = 1.2. (b) Probability distribution 
function of the number bh of hydrophobicity-mediated bonds 
per peptide in: a-oligomer (black line), 7-oligomers (orange 
and brown lines), l^^-sheet (blue line), 2;5-sheet (green line), 
and 3/3-sheet (red line) at the respective values noted above. 




FIG. 3: a-oligomer results, (a) Temperature dependence of 
solubility: squares, open down triangles, diamonds, up trian- 
gles and circles - simulation data for cluster size n = 100, 200, 
300, 500 and 700, respectively; black trianlges - derived data 
for n = 00 from best-fit analysis; lines - best fit of equation [2] 
(b) Size dependence of solubility: squares, open down trian- 
gles, diamonds, up triangles and circles - simulation data at 
= 0.9, 0.85, 0.8, 0.75, 0.7, respectively; lines - best fit of 
equation [1] (c) Dependence of parameter a (circles) and spe- 
cific surface energy a (squares) on 0; the straight lines are 
drawn to guide the eye. (d) Solubility of bulk a-oligomeric 
phase: triangles - data from (a) for n = 00; line - best fit of 
equation [2] 



In Ce,n = In Ce + a/Ofi^^^ 



(1) 
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in which Ce is the equihbrium concentration (or 
solubihty) of the infinitely large a-oligomer, and 
a = 2cav'^^^ /3kTf. Here a is the specific surface energy 
of the a-oligomer/solution interface, v is the volume oc- 
cupied by a peptide in a-oligomer, and for a spherical 
a-oligomer the shape factor c = (367r)^/^. In In (7e,n-vs- 
coordinates, a linear fit of equation [l] to our data 
(Fig. ^) yielded values for In Ce (from the intercept) at 
each temperature 6>, and hence the solubility Ce{0) of the 
infinitely large a-oligomer (the down triangles in Fig. [3|i 
and in Fig. [4]). Also, the slope of the fit provided the 
a value at each 0. The resulting a{6) dependence (the 
circles in Fig. ^) implies that the a-oligomer surface ten- 
sion a decreases with temperature as illustrated by the 
squares in Fig. These squares represent the a val- 
ues calculated from the above expression for a with the 
aid of c = (367r)^/^ (spherical a-oligomers), Tf = 300 K 
(peptides with e = 2.07 x 10"^^ J) and v = 1.2 nm^. The 
latter is the mid-range value of the peptide volume for 6 
between 0.7 and 0.9, and was obtained by dividing the 
volume of all peptides in the bulk of an a-oligomer by the 
number of these peptides. As seen in Fig. [S]^, a = 4.5 to 
6.1 mJ/m^ and is in the range of 0.1 to 30 mJ/m^, val- 
ues reported both theoretically and experimentally for 
the specific surface energy of protein crystals in aqueous 
solutions [l [151 [E]. 

Fitting the integrated van't Hoff equation 



Ce = Crexp{-X/0) 



(2) 



to the Ce{0) data in Fig. [sji enables determining the 
dimensionless latent heat X = L/kTj of peptide aggre- 
gation into an infinitely large a-oligomer. Here Cr is 
a practically temperature-independent reference peptide 
concentration, and L is the latent heat of peptide ag- 
gregation into such an oligomer. The best-fit result 
is Cr = 5.2 X 10^ mM and A = 15.1 ± 0.3. With 
Tf = 0.2e/k it thus follows that L = XkTf has the value 
of 3e. Hence, recalling that in our simulations e = 20eh 
and that on average bh = 155, we find that the latent 
heat L = dOe^ of peptide aggregation into a-oligomer is 
somewhat lower (by the factor 0.78) than half of the av- 
erage binding energy bh^h = 155e/j, that a fully folded 
peptide has in the bulk a-oligomeric phase. For com- 
parison, in the Haas-Drenth lattice model [T^ of protein 
crystals the latent heat of crystallisation is just half the 
average protein binding energy. Also, with a = 4.0 to 5.4 
(see Fig. [sj:)), A = 15.1 and c = (367r)i/^, for the ratio 
GV^I'^ jL = 3a/2cX we obtain values from 0.08 to 0.11 in 
the range studied. Thus, av'^^^/L for the a-oligomers is 
considerably smaller than for atomic or simple molecular 
substances which are known to follow the Stefan- Skapski- 
Turnbull relation av^l'^jL = 0.2 - 0.6 (e.g., Ref. [M]). 

Next in our study was the determination of the 7- 
oligomer solubility. Following the procedure outlined for 
the a-oligomers, we prepared a cluster of size n = 100. 
As the 7-oligomer consists of fully folded, partially folded 
and unfolded peptides, their proportion in it was taken to 
be the same as that of the peptides in the solution at the 
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FIG. 4: Peptide phase diagram. Solubility-vs-temperature 
data for a-oligomer (down triangles) of infinitely large size, 7- 
oligomer (circles) of size n = 100, 1/3-sheet (solid diamonds), 
2/3-sheet (up triangles) and 3/3-sheet (right triangles), all of 
size n = 40. Lines a, 1/3, 2/3 and 3/3 are obtained by best fit 
of equation [2] to the data; line 7 is just a guide to the eye; 
labels a-OL, 7-OL, 1, 2, 3, and S refer to different stability 
regions (see main text); the open diamonds refer to n = 20. 



chosen temperature. The simulations, in which we per- 
formed pivot, crankshaft, reptation, rotation and transla- 
tion moves, revealed that the 7-oligomer was a transient 
formation always transforming into an i/3-sheet. For that 
reason it was necessary to use a biasing potential that 
arrested the 7-oligomer transformation by not allowing 
a peptide to form more than one interpeptide hydrogen 
bond with any other peptide in the cluster. The simula- 
tion Ce^nifi) data for 7-oligomers of size n = 100 (the cir- 
cles in Fig. [4| show that the 7-oligomer solubility cannot 
be described by equation [2] The non-monotonic behav- 
ior of the 7-oligomer solubility is a result of the tempera- 
ture dependence of the proportion of folded and unfolded 
peptides in the oligomer. This effect is strongest for be- 
tween 0.9 and 1.1, the temperature range in which the 
peptides commence unfolding and forming additional in- 
terpeptide hydrogen bonds that decrease the 7-oligomer 
solubility. Partial unfolding of proteins is regarded as be- 
ing a crucial step in protein aggregation pTt 118] and our 
observation of the decreased 7-oligomer solubility with 
the peptide unfolding rationalizes this view. Regrettably, 
as the simulations with 7-oligomers were computationally 
much more demanding than those with the a-oligomers, 
we could not investigate the effect of the cluster size on 
the 7-oligomer solubility. 

Finally, we determined the solubility lines for the 1/3- 
, 2/3- and 3/3-sheets. In these simulations all i/3-sheets 
consisted of n = 40 unfolded peptides. Since i/3-sheet 
formation at a fixed i value is a one-dimensional clus- 
tering process, the i /3-sheet solubility is expected to be 
n-independent. We could not confirm this expectation 
for all i = 1,2 and 3, but simulations with 1/3-sheets of 
size n = 20 revealed that these sheets indeed had the 
solubility of those with n = 40 (see the diamonds in 
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Fig. [4| . The simulations showed also that a new peptide 
monolayer could occasionally form on the i /3-sheet stud- 
ied, transforming it into an (i + 1) /3-sheet. In order to 
calculate the solubility of these metastable i/3-sheets we 
had to include a biasing potential preventing this trans- 
formation from occurring. The so-obtained simulation 
Ce,n(^) data are shown in Fig. 4] by symbols. A fit of 
equation [2] to these data provides values for the refer- 
ence peptide concentration and latent heat of peptide 
aggregation into an sheet: Cr = 2.1 x 10^^ mM and 
A = 60 ± 6 for the 1/3-sheet, Cr = 4.3 x 10^"^ mM and 
A = 66 ± 4 for the 2/3-sheet, and Cr = 1.2 x 10'^^ mM 
and A = 66 ±3 for the 3/3-sheet. With the Tf value given 
above, it thus follows that L = XkTf has the values of 
12e, 13. 2e and 13. 2e for the 1/3-, 2/3- and 3/3-sheet, re- 
spectively. Again, these L values can be compared with 
the average binding energy be -\- bh^h = 22e, 24e and 24e 
that an unfolded peptide has in the bulk 1/3-, 2/3- and 3/3- 
sheet, respectively. These numbers follow from the 6, bh 
and e/ch values already given above. We see that, similar 
to the a-oligomer, and in close correspondence with lat- 
tice models, for the i/3-sheets L is about half the peptide 
average binding energy. We note that the L value for 
the 3/3-sheet may be regarded as representative for that 
of the infinitely thick /3-sheet, because in our simulation 
model the hydrophobicity-mediated interaction is limited 
within two successive peptide monolayers only. This lim- 
itation is refiected by the virtually equal average binding 
energies of a peptide in the 2/3- and 3/3-sheets. Also, the 
L values for the i/3-sheets are considerably greater than 
those for the a- and 7-oligomers because of the presence 
of more hydrogen bonds per peptide in the i/3-sheets. 

In conclusion, the phase diagram in Fig. 4 shows that 
there are two regions of thermodynamic stability, one of 
the peptide solution (region S), and one of the infinitely 
thick /3-sheet. The dividing line between these two stable 
phases is approximately the solubility line of the 3/3-sheet 
(line 3/3), because in our model the peptide interactions 
are restricted within two successive /3-sheets. In addition 
to this, the phase diagram reveals the hierarchic exis- 
tence of various metastable peptide phases. In region 3 
the 3/3-sheet is stable with respect to the solution, but 
metastable with respect to thicker i/3-sheets {i > 3). In 
region 2 the 2/3-sheet is stable with respect to the solu- 
tion, but metastable with respect to all thicker i/3-sheets 
(i > 2). In region 1 the 1/3-sheet is stable with respect 
to the solution, but metastable with respect to all other 
z/3-sheets {i > 1). In region 7-OL the 7-oligomer is also 
stable with respect to the solution, but metastable with 



respect to all i^^-sheets. Finally, in region a-OL, though 
being stable with respect to the solution, the a-oligomer 
is metastable with respect to all other aggregates. A 
similar hierarchy in the stability of protein aggregates 
has been observed in an experimentally obtained phase 
diagram of Human /3Bl-crystallin[3 . The high stability 
of the i/3-sheets arises from the fact that their structure 
is largely due to the preponderant presence of hydrogen 
bonds between the peptides[19 . Our finding of the ex- 
istence of various metastable peptide phases and of hi- 
erarchy in their metastability provides a solid basis for 
describing the formation of amyloid fibrils and for un- 
derstanding why these fibrils can grow out of disordered 
peptide aggregates pT| |2Q] . 
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